Micro-Video Popularity Prediction Via Multimodal Variational Information Bottleneck

نویسندگان

چکیده

As an emerging type of user-generated content, micro-video drastically enriches people's entertainment experiences and social interactions. However, the popularity pattern individual still remains elusive among researchers. One major challenges is that potential a tends to fluctuate under impact various external factors, which makes it full uncertainties. In addition, since micro-videos are mainly uploaded by individuals lack professional techniques, multiple types noise could exist obscure useful information. this paper, we propose multimodal variational encoder-decoder (MMVED) framework for prediction tasks. MMVED learns stochastic Gaussian embedding informative its level while preserves inherent uncertainties simultaneously. Moreover, through optimization deep information bottleneck lower-bound (IBLBO), learned hidden representation shown be maximally expressive about target compressive in features. Furthermore, Bayesian product-of-experts principle applied encoder, where decision keeping or discarding made comprehensively with all available modalities. Extensive experiments conducted on public dataset collect from Xigua demonstrate effectiveness proposed framework.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deep Variational Information Bottleneck

We present a variational approximation to the information bottleneck of Tishby et al. (1999). This variational approach allows us to parameterize the information bottleneck model using a neural network and leverage the reparameterization trick for efficient training. We call this method “Deep Variational Information Bottleneck”, or Deep VIB. We show that models trained with the VIB objective ou...

متن کامل

Stochastic Variational Video Prediction

Predicting the future in real-world settings, particularly from raw sensory observations such as images, is exceptionally challenging. Real-world events can be stochastic and unpredictable, and the high dimensionality and complexity of natural images require the predictive model to build an intricate understanding of the natural world. Many existing methods tackle this problem by making simplif...

متن کامل

Relevant sparse codes with variational information bottleneck

In many applications, it is desirable to extract only the relevant aspects of data. A principled way to do this is the information bottleneck (IB) method, where one seeks a code that maximizes information about a ‘relevance’ variable, Y , while constraining the information encoded about the original data, X . Unfortunately however, the IB method is computationally demanding when data are high-d...

متن کامل

Towards Boosting Video Popularity via Tag Selection

Video content abounds on the Web. Although viewers may reach items via referrals, a large portion of the audience comes from keywordbased search. Consequently, the textual features of multimedia content (e.g., title, description, tags) will directly impact the view count of a particular item, and ultimately the advertisement-generated revenue. This study makes progress on the problem of automat...

متن کامل

Recurrent Neural Networks for Online Video Popularity Prediction

In this paper, we address the problem of popularity prediction of online videos shared in social media. We prove that this challenging task can be approached using recently proposed deep neural network architectures. We cast the popularity prediction problem as a classification task and we aim to solve it using only visual cues extracted from videos. To that end, we propose a new method based o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Multimedia

سال: 2023

ISSN: ['1520-9210', '1941-0077']

DOI: https://doi.org/10.1109/tmm.2021.3120537